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The 2002-3 pandemic caused by severe acute respiratory syndrome 
coronavirus (SARS-CoV) was one of the most significant public health 
events in recent history’. An ongoing outbreak of Middle East respira- 
tory syndrome coronavirus’ suggests that this group of viruses remains 
a key threat and that their distribution is wider than previously recog- 
nized. Although bats have been suggested to be the natural reservoirs 
of both viruses*°, attempts to isolate the progenitor virus of SARS- 
CoV from bats have been unsuccessful. Diverse SARS-like corona- 
viruses (SL-CoVs) have now been reported from bats in China, 
Europe and Africa’ *, but none is considered a direct progenitor 
of SARS-CoV because of their phylogenetic disparity from this virus 
and the inability of their spike proteins to use the SARS-CoV cellular 
receptor molecule, the human angiotensin converting enzyme II 
(ACE2)”"°. Here we report whole-genome sequences of two novel bat 
coronaviruses from Chinese horseshoe bats (family: Rhinolophidae) 
in Yunnan, China: RsSSHC014 and Rs3367. These viruses are far more 
closely related to SARS-CoV than any previously identified bat coro- 
naviruses, particularly in the receptor binding domain of the spike 
protein. Most importantly, we report the first recorded isolation of 
a live SL-CoV (bat SL-CoV-WIV1) from bat faecal samples in Vero 
E6 cells, which has typical coronavirus morphology, 99.9% sequence 
identity to Rs3367 and uses ACE2 from humans, civets and Chinese 
horseshoe bats for cell entry. Preliminary in vitro testing indicates 
that WIV] also has a broad species tropism. Our results provide the 
strongest evidence to date that Chinese horseshoe bats are natural 
reservoirs of SARS-CoV, and that intermediate hosts may not be 
necessary for direct human infection by some bat SL-CoVs. They also 
highlight the importance of pathogen-discovery programs targeting 
high-risk wildlife groups in emerging disease hotspots as a strategy 
for pandemic preparedness. 

The 2002-3 pandemic of SARS' and the ongoing emergence of the 
Middle East respiratory syndrome coronavirus (MERS-CoV)” demon- 
strate that CoVs are a significant public health threat. SARS-CoV was 
shown to use the human ACE2 molecule as its entry receptor, and this 
is considered a hallmark of its cross-species transmissibility'’. The receptor 
binding domain (RBD) located in the amino-terminal region (amino 
acids 318-510) of the SARS-CoV spike (S) protein is directly involved 
in binding to ACE2 (ref. 12). However, despite phylogenetic evidence 
that SARS-CoV evolved from bat SL-CoVs, all previously identified 
SL-CoVs have major sequence differences from SARS-CoV in the RBD 
of their S proteins, including one or two deletions®’. Replacing the RBD 
of one SL-CoV S protein with SARS-CoV S conferred the ability to use 
human ACE2 and replicate efficiently in mice”’’. However, to date, no 
SL-CoVs have been isolated from bats, and no wild-type SL-CoV of bat 
origin has been shown to use ACE2. 

We conducted a 12-month longitudinal survey (April 2011-September 
2012) of SL-CoVs ina colony of Rhinolophus sinicus at a single location 


in Kunming, Yunnan Province, China (Extended Data Table 1). A total 
of 117 anal swabs or faecal samples were collected from individual bats 
using a previously published method”. A one-step reverse transcrip- 
tion (RT)-nested PCR was conducted to amplify the RNA-dependent 
RNA polymerase (RdRP) motifs A and C, which are conserved among 
alphacoronaviruses and betacoronaviruses””. 

Twenty-seven of the 117 samples (23%) were classed as positive by 
PCR and subsequently confirmed by sequencing. The species origin of 
all positive samples was confirmed to be R. sinicus by cytochrome b 
sequence analysis, as described previously’®. A higher prevalence was 
observed in samples collected in October (30% in 2011 and 48.7% in 
2012) than those in April (7.1% in 2011) or May (7.4% in 2012) (Extended 
Data Table 1). Analysis of the S protein RBD sequences indicated the 
presence of seven different strains of SL-CoVs (Fig. la and Extended 
Data Figs 1 and 2). In addition to RBD sequences, which closely matched 
previously described SL-CoVs (Rs672, Rf1 and HKU3)*"”"*, two novel 
strains (designated SL-CoV RsSHC014 and Rs3367) were discovered. 
Their full-length genome sequences were determined, and both were 
found to be 29,787 base pairs in size (excluding the poly(A) tail). The 
overall nucleotide sequence identity of these two genomes with human 
SARS-CoV (Tor2 strain) is 95%, higher than that observed previously 
for bat SL-CoVs in China (88-92%)**'”"* or Europe (76%)° (Extended 
Data Table 2 and Extended Data Figs 3 and 4). Higher sequence iden- 
tities were observed at the protein level between these new SL-CoVs 
and SARS-CoVs (Extended Data Tables 3 and 4). To understand the 
evolutionary origin of these two novel SL-CoV strains, we conducted 
recombination analysis with the Recombination Detection Program 
4.0 package” using available genome sequences of bat SL-CoV strains 
(Rfl, Rp3, Rs672, Rm1, HKU3 and BM48-31) and human and civet 
representative SARS-CoV strains (BJ01, SZ3, Tor2 and GZ02). Three 
breakpoints were detected with strong P values (<10 *”) and supported 
by similarity plot and bootscan analysis (Extended Data Fig. 5a, b). Break- 
points were located at nucleotides 20,827, 26,553 and 28,685 in the 
Rs3367 (and RsSsHC014) genome, and generated recombination frag- 
ments covering nucleotides 20,827-26,533 (5,727 nucleotides) (inclu- 
ding partial open reading frame (ORF) 1b, full-length S, ORF3, E and 
partial M gene) and nucleotides 26,534-28,685 (2,133 nucleotides) 
(including partial ORF M, full-length ORF6, ORF7, ORF8 and partial 
N gene). Phylogenetic analysis using the major and minor parental regions 
suggested that Rs3367, or RSSHCO14, is the descendent ofa recombination 
of lineages that ultimately lead to SARS-CoV and SL-CoV Rs672 (Fig. 1b). 

The most notable sequence differences between these two new SL- 
CoVs and previously identified SL-CoVs is in the RBD regions of their 
S proteins. First, they have higher amino acid sequence identity to SARS- 
CoV (85% and 96% for RSSHC014 and Rs3367, respectively). Second, 
there are no deletions and they have perfect sequence alignment with 
the SARS-CoV RBD region (Extended Data Figs 1 and 2). Structural 
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Figure 1 | Phylogenetic tree based on amino acid sequences of the S RBD 
region and the two parental regions of bat SL-CoV Rs3367 or RsSsHCO014. 
a, SARS-CoV S protein amino acid residues 310-520 were aligned with 
homologous regions of bat SL-CoVs using the ClustalW software. A maximum- 
likelihood phylogenetic tree was constructed using a Poisson model with 
bootstrap values determined by 1,000 replicates in the MEGAS software package. 
The RBD sequences identified in this study are in bold and named by the sample 
numbers. The key amino acid residues involved in interacting with the human 
ACE2 molecule are indicated on the right of the tree. SARS-CoV GZ02, BJO1 and 
Tor2 were isolated from patients in the early, middle and late phase, respectively, 
of the SARS outbreak in 2003. SARS-CoV SZ3 was identified from Paguma 
larvata in 2003 collected in Guangdong, China. SL-CoV Rp3, Rs672 and HKU3-1 
were identified from R. sinicus collected in China (respectively: Guangxi, 2004; 
Guizhou, 2006; Hong Kong, 2005). Rfl and Rm1 were identified from 


and mutagenesis studies have previously identified five key residues 
(amino acids 442, 472, 479, 487 and 491) in the RBD of the SARS-CoV 
S protein that have a pivotal role in receptor binding”’*’. Although all 
five residues in the RsSHC014 S protein were found to be different 
from those of SARS-CoV, two of the five residues in the Rs3367 RBD 
were conserved (Fig. 1 and Extended Data Fig. 1). 

Despite the rapid accumulation of bat CoV sequences in the last 
decade, there has been no report of successful virus isolation®**”’. We 
attempted isolation from SL-CoV PCR-positive samples. Using an 
optimized protocol and Vero E6 cells, we obtained one isolate which 
caused cytopathic effect during the second blind passage. Purified virions 
displayed typical coronavirus morphology under electron microscopy 
(Fig. 2). Sequence analysis using a sequence-independent amplifica- 
tion method” to avoid PCR-introduced contamination indicated that 
the isolate was almost identical to Rs3367, with 99.9% nucleotide genome 
sequence identity and 100% amino acid sequence identity for the S1 
region. The new isolate was named SL-CoV-WIV1. 
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R. ferrumequinum and R. macrotis, respectively, collected in Hubei, China, in 
2004. Bat SARS-related CoV BM48-31 was identified from R. blasii collected in 
Bulgaria in 2008. Bat CoV HKU9-1 was identified from Rousettus leschenaultii 
collected in Guangdong, China in 2005/2006 and used as an outgroup. All 
sequences in bold and italics were identified in the current study. Filled triangles, 
circles and diamonds indicate samples with co-infection by two different 
SL-CoVs. ‘—’ indicates the amino acid deletion. b, Phylogenetic origins of the two 
parental regions of Rs3367 or RSSHC014. Maximum likelihood phylogenetic 
trees were constructed from alignments of two fragments covering nucleotides 
20,827-26,533 (5,727 nucleotides) and 26,534 —28,685 (2,133 nucleotides) of the 
Rs3367 genome, respectively. For display purposes, the trees were midpoint 
rooted. The taxa were annotated according to strain names: SARS-CoV, SARS 
coronavirus; SARS-like CoV, bat SARS-like coronavirus. The two novel SL-CoVs, 
Rs3367 and RsSHC014, are in bold and italics. 


To determine whether WIV1 can use ACE2 as a cellular entry receptor, 
we conducted virus infectivity studies using HeLa cells expressing or 
not expressing ACE2 from humans, civets or Chinese horseshoe bats. 
We found that WIV1 is able to use ACE2 of different origins as an entry 
receptor and replicated efficiently in the ACE2-expressing cells (Fig. 3). 
This is, to our knowledge, the first identification of a wild-type bat SL- 
CoV capable of using ACE2 as an entry receptor. 

To assess its cross-species transmission potential, we conducted infec- 
tivity assays in cell lines from a range of species. Our results (Fig. 4 and 
Extended Data Table 5) indicate that bat SL-CoV-WIV1 can grow in 
human alveolar basal epithelial (A549), pig kidney 15 (PK-15) and 
Rhinolophus sinicus kidney (RSKT) cell lines, but not in human cervix 
(HeLa), Syrian golden hamster kidney (BHK21), Myotis davidii kidney 
(BK), Myotis chinensis kidney (MCKT), Rousettus leschenaulti kidney 
(RLK) or Pteropus alecto kidney (PaKi) cell lines. Real-time RT-PCR 
indicated that WIV1 replicated much less efficiently in A549, PK-15 
and RSKT cells than in Vero E6 cells (Fig. 4). 
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Figure 2 | Electron micrograph of purified virions. Virions from a 10-ml 
culture were collected, fixed and concentrated/purified by sucrose gradient 
centrifugation. The pelleted viral particles were suspended in 100 ul PBS, 
stained with 2% phosphotungstic acid (pH 7.0) and examined directly using a 
Tecnai transmission electron microscope (FEI) at 200 kV. 


To assess the cross-neutralization activity of human SARS-CoV sera 
against WIV1, we conducted serum-neutralization assays using nine 
convalescent sera from SARS patients collected in 2003. The results 
showed that seven of these were able to completely neutralize 100 tissue 
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Figure 3 | Analysis of receptor usage of SL-CoV-WIV1 determined by 
immunofluorescence assay and real-time PCR. Determination of virus 
infectivity in HeLa cells with and without the expression of ACE2. b, bat; 

c, civet; h, human. ACE2 expression was detected with goat anti-humanACE2 
antibody followed by fluorescein isothiocyanate (FITC)-conjugated donkey 
anti-goat IgG. Virus replication was detected with rabbit antibody against the 
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culture infectious dose 50 (TCID;,.) WIV1 at dilutions of 1:10 to 1:40, 
further confirming the close relationship between WIV1 and SARS-CoV. 

Our findings have important implications for public health. First, 
they provide the clearest evidence yet that SARS-CoV originated in bats. 
Our previous work provided phylogenetic evidence of this’, but the lack 
of an isolate or evidence that bat SL-CoVs can naturally infect human 
cells, until now, had cast doubt on this hypothesis. Second, the lack of 
capacity of SL-CoVs to use of ACE2 receptors has previously been 
considered as the key barrier for their direct spillover into humans, suppor- 
ting the suggestion that civets were intermediate hosts for SARS-CoV 
adaptation to human transmission during the SARS outbreak”. However, 
the ability of SL-CoV-WIV1 to use human ACE2 argues against the 
necessity of this step for SL-CoV-WIVI1 and suggests that direct bat- 
to-human infection is a plausible scenario for some bat SL-CoVs. This 
has implications for public health control measures in the face of poten- 
tial spillover ofa diverse and growing pool of recently discovered SARS- 
like CoVs with a wide geographic distribution. 

Our findings suggest that the diversity of bat CoVs is substantially 
higher than that previously reported. In this study we were able to demon- 
strate the circulation of at least seven different strains of SL-CoVs within a 
single colony of R. sinicus during a 12-month period. The high genetic 
diversity of SL-CoVs within this colony was mirrored by high pheno- 
typic diversity in the differential use of ACE2 by different strains. It 
would therefore not be surprising if further surveillance reveals a broad 
diversity of bat SL-CoVs that are able to use ACE2, some of which may 
have even closer homology to SARS-CoV than SL-CoV-WIV1. Our 
results—in addition to the recent demonstration of MERS-CoV in a 
Saudi Arabian bat’, and of bat CoVs closely related to MERS-CoV in 
China, Africa, Europe and North America’*®*’—suggest that bat coro- 
naviruses remain a substantial global threat to public health. 

Finally, this study demonstrates the public health importance of path- 
ogen discovery programs targeting wildlife that aim to identify the ‘known 
unknowns —previously unknown viral strains closely related to known 
pathogens. These programs, focused on specific high-risk wildlife groups 
and hotspots of disease emergence, may be a critical part of future global 
strategies to predict, prepare for, and prevent pandemic emergence”. 
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SL-CoV Rp3 nucleocapsid protein followed by cyanine 3 (Cy3)-conjugated 
mouse anti-rabbit IgG. Nuclei were stained with DAPI (4’,6-diamidino-2- 
phenylindole). The columns (from left to right) show staining of nuclei (blue), 
ACE2 expression (green), virus replication (red), merged triple-stained 
images and real-time PCR results, respectively. (n = 3); error bars represent 
standard deviation. 
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Figure 4 | Analysis of host range of SL-CoV-WIV1 determined by 
immunofluorescence assay and real-time PCR. Virus infection in A549, 
RSKT, Vero E6 and PK-15 cells. Virus replication was detected as described for 
Fig. 3. The columns (from left to right) show staining of nuclei (blue), virus 
replication (red), merged double-stained images and real-time PCR results, 
respectively. n = 3; error bars represent s.d. 


METHODS SUMMARY 


Throat and faecal swabs or fresh faecal samples were collected in viral transport 
medium as described previously™*. All PCR was conducted with the One-Step RT- 
PCR kit (Invitrogen). Primers targeting the highly conserved regions of the RdRP 
gene were used for detection of all alphacoronaviruses and betacoronaviruses as 
described previously’’. Degenerate primers were designed on the basis of all avail- 
able genomic sequences of SARS-CoVs and SL-CoVs and used for amplification of 
the RBD sequences of S genes or full-length genomic sequences. Degenerate primers 
were used for amplification of the bat ACE2 gene as described previously’. PCR 
products were gel purified and cloned into pGEM-T Easy Vector (Promega). At 
least four independent clones were sequenced to obtain a consensus sequence. PCR- 
positive faecal samples (in 200 pl buffer) were gradient centrifuged at 3,000-12,000g 
and supernatant diluted at 1:10 in DMEM before being added to Vero E6 cells. After 
incubation at 37 °C for 1h, inocula were removed and replaced with fresh DMEM 
with 2% FCS. Cells were incubated at 37 °C and checked daily for cytopathic effect. 
Cell lines from different origins were grown on coverslips in 24-well plates and 
inoculated with the novel SL-CoV at a multiplicity of infection of 10. Virus repli- 
cation was detected at 24h after infection using rabbit antibodies against the SL- 
CoV Rp3 nucleocapsid protein followed by Cy3-conjugated goat anti-rabbit IgG. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Sampling. Bats were trapped in their natural habitat as described previously”. 
Throat and faecal swab samples were collected in viral transport medium (VTM) 
composed of Hank’s balanced salt solution, pH 7.4, containing BSA (1%), ampho- 
tericin (15 tg ml”), penicillin G (100 U ml‘) and streptomycin (50 Lg ml” '). To 
collect fresh faecal samples, clean plastic sheets measuring 2.0 by 2.0 m were placed 
under known bat roosting sites at about 18:00 h each evening. Relatively fresh faecal 
samples were collected from sheets at approximately 05:30-06:00 the next morning 
and placed in VIM. Samples were transported to the laboratory and stored at 
—80°C until use. All animals trapped for this study were released back to their 
habitat after sample collection. All sampling processes were performed by veter- 
inarians with approval from Animal Ethics Committee of the Wuhan Institute of 
Virology (WIVH05210201) and EcoHealth Alliance under an inter-institutional 
agreement with University of California, Davis (UC Davis protocol no. 16048). 
RNA extraction, PCR and sequencing. RNA was extracted from 140 pl of swab 
or faecal samples with a Viral RNA Mini Kit (Qiagen) following the manufacturer's 
instructions. RNA was eluted in 60 pl RNAse-free buffer (buffer AVE, Qiagen), 
then aliquoted and stored at —80 °C. One-step RT-PCR (Invitrogen) was used to 
detect coronavirus sequences as described previously’’. First round PCR was con- 
ducted in a 25-l reaction mix containing 12.5 ul PCR 2X reaction mix buffer, 
10 pmol of each primer, 2.5 mM MgSO4, 20 U RNase inhibitor, 1 pl SuperScript 
I/ Platinum Taq Enzyme Mix and 5 ul RNA. Amplification of the RdRP-gene frag- 
ment was performed as follows: 50 °C for 30 min, 94 °C for 2 min, followed by 40 
cycles consisting of 94°C for 15s, 62 °C for 15 s, 68 °C for 40s, and a final exten- 
sion of 68 °C for 5 min. Second round PCR was conducted in a 25-l reaction mix 
containing 2.5 ul PCR reaction buffer, 5 pmol of each primer, 5|0mM MgCh, 
0.5mM dNTP, 0.1 pl Platinum Taq Enzyme (Invitrogen) and 1 ul first round 
PCR product. The amplification of RdRP-gene fragment was performed as fol- 
lows: 94 °C for 5 min followed by 35 cycles consisting of 94 °C for 30s, 52 °C for 
30s, 72 °C for 40s, and a final extension of 72 °C for 5 min. 

To amplify the RBD region, one-step RT-PCR was performed with primers 
designed based on available SARS-CoV or bat SL-CoVs (first round PCR primers; 
F, forward; R, reverse: CoVS931F-5'-VWGADGTTGTKAGRTTYCCT-3’ and 
CoVS1909R-5’-TAARACA VCCWGCYTGWGT-3’; second PCR primers: CoVS 
951F-5'-TGTKAGRTTYCCTAAYATTAC-3’ and CoVS1805R-5’-ACATCYTG 
ATANARAACAGC-3’), First-round PCR was conducted in a 25-ul reaction mix 
as described above except primers specific for the S gene were used. The ampli- 
fication of the RBD region of the S gene was performed as follows: 50 °C for 30 min, 
94°C for 2 min, followed by 35 cycles consisting of 94°C for 15s, 43 °C for 15s, 
68 °C for 90s, and a final extension of 68 °C for 5 min. Second-round PCR was 
conducted in a 25-ul reaction mix containing 2.5 pl PCR reaction buffer, 5 pmol of 
each primer, 50 mM MgCl, 0.5 mM dNTP, 0.1 pl Platinum Taq Enzyme (Invitrogen) 
and 1 ul first round PCR product. Amplification was performed as follows: 94 °C 
for 5 min followed by 40 cycles consisting of 94 °C for 30 s, 41 °C for 30 s, 72 °C for 
60s, and a final extension of 72 °C for 5 min. 

PCR products were gel purified and cloned into pGEM-T Easy Vector (Promega). 
At least four independent clones were sequenced to obtain a consensus sequence 
for each of the amplified regions. 

Sequencing full-length genomes. Degenerate coronavirus primers were designed 
based on all available SARS-CoV and bat SL-CoV sequences in GenBank and specific 
primers were designed from genome sequences generated from previous rounds of 
sequencing in this study (primer sequences will be provided upon request). All 
PCRs were conducted using the One-Step RT-PCR kit (Invitrogen). The 5’ and 3’ 
genomic ends were determined using the 5’ or 3’ RACE kit (Roche), respectively. 
PCR products were gel purified and sequenced directly or following cloning into 
pGEM-T Easy Vector (Promega). At least four independent clones were sequenced 
to obtain a consensus sequence for each of the amplified regions and each region 
was sequenced at least twice. 

Sequence analysis and databank accession numbers. Routine sequence manage- 
ment and analysis was carried out using DNAStar or Geneious. Sequence align- 
ment and editing was conducted using ClustalW, BioEdit or GeneDoc. Maximum 
Likelihood phylogenetic trees based on the protein sequences were constructed 
using a Poisson model with bootstrap values determined by 1,000 replicates in the 
MEGAS software package. 

Sequences obtained in this study have been deposited in GenBank as follows 
(accession numbers given in parenthesis): full-length genome sequence of SL-CoV 
RsSHC014 and Rs3367 (KC881005, KC881006); full-length sequence of WIV1 S 
(KC881007); RBD (KC880984-KC881003); ACE2 (KC8810040). SARS-CoV 
sequences used in this study: human SARS-CoV strains Tor2 (AY274119), BJO1 
(AY278488), GZ02 (AY390556) and civet SARS-CoV strain SZ3 (AY304486). Bat 
coronavirus sequences used in this study: Rs672 (FJ588686), Rp3 (DQ071615), Rfl 
(DQ412042), Rm1 (DQ412043), HKU3-1 (DQ022305), BM48-31 (NC_014470), 
HKU9-1 (NC_009021), HKU4 (NC_009019), HKU5 (NC_009020), HKU8 (DQ249228), 
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HKU2 (EF203067), BtCoV512 (NC_009657), 1A (NC_010437). Other coronavirus 
sequences used in this study: HCoV-229E (AF304460), HCoV-OC43 (AY391777), 
HCoV-NL63 (AY567487), HKU1 (NC_006577), EMC (JX869059), FIPV (NC_002306), 
PRCV (DQ811787), BWCoV (NC_010646), MHV (AY700211), IBV (AY851295). 
Amplification, cloning and expression of the bat ACE2 gene. Construction of 
expression clones for human and civet ACE2 in pcDNA3.1 has been described 
previously’. Bat ACE2 was amplified from a R. sinicus (sample no. 3357). In brief, 
total RNA was extracted from bat rectal tissue using the RNeasy Mini Kit (Qiagen). 
First-strand complementary DNA was synthesized from total RNA by reverse trans- 
cription with random hexamers. Full-length bat ACE2 fragments were amplified 
using forward primer bAF2 and reverse primer bAR2 (ref. 29). The ACE2 gene was 
cloned into pCDNA3.1 with KpnI and Xhol, and verified by sequencing. Purified 
ACE2 plasmids were transfected to HeLa cells. After 24 h, lysates of HeLa cells 
expressing human, civet, or bat ACE2 were confirmed by western blot or immu- 
nofluorescence assay. 

Western blot analysis. Lysates of cells or filtered supernatants containing pseu- 
doviruses were separated by SDS-PAGE, followed by transfer to a nitrocellulose 
membrane (Millipore). For detection of S protein, the membrane was incubated 
with rabbit anti-Rp3 S fragment (amino acids 561-666) polyantibodies (1:200), 
and the bound antibodies were detected by alkaline phosphatase (AP)-conjugated 
goat anti-rabbit IgG (1:1,000). For detection of HIV-1 p24 in supernatants, mono- 
clonal antibody against HIV p24 (p24 MAb) was used as the primary antibody at a 
dilution of 1:1,000, followed by incubation with AP-conjugated goat anti-mouse IgG 
at the same dilution. To detect the expression of ACE2 in HeLa cells, goat antibody 
against the human ACE2 ectodomain (1:500) was used as the first antibody, followed 
by incubation with horseradish peroxidase-conjugated donkey anti-goat IgG (1:1,000). 
Virus isolation. Vero E6 cell monolayers were maintained in DMEM supplemen- 
ted with 10% FCS. PCR-positive samples (in 200 pl buffer) were gradient centri- 
fuged at 3,000-12,000g, and supernatant were diluted 1:10 in DMEM before being 
added to Vero E6 cells. After incubation at 37 °C for 1 h, inocula were removed and 
replaced with fresh DMEM with 2% FCS. Cells were incubated at 37 °C for 3 days 
and checked daily for cytopathic effect. Double-dose triple antibiotics penicillin/ 
streptomycin/amphotericin (Gibco) were included in all tissue culture media (peni- 
cillin 200IU ml’, streptomycin 0.2 mg ml‘, amphotericin 0.5 1g ml"). Three 
blind passages were carried out for each sample. After each passage, both the culture 
supernatant and cell pellet were examined for presence of virus by RT-PCR using 
primers targeting the RdRP or S gene. Virions in supernatant (10 ml) were collected 
and fixed using 0.1% formaldehyde for 4h, then concentrated by ultracentrifuga- 
tion through a 20% sucrose cushion (5 ml) at 80,000g for 90 min using a Ty90 rotor 
(Beckman). The pelleted viral particles were suspended in 100 pl PBS, stained with 
2% phosphotungstic acid (pH 7.0) and examined using a Tecnai transmission 
electron microscope (FEI) at 200 kV. 

Virus infectivity detected by immunofluorescence assay. Cell lines used for this 
study and their culture conditions are summarized in Extended Data Table 5. Virus 
titre was determined in Vero E6 cells by cytopathic effect (CPE) counts. Cell lines 
from different origins and HeLa cells expressing ACE2 from human, civet or Chinese 
horseshoe bat were grown on coverslips in 24-well plates (Corning) incubated with 
bat SL-CoV-WIV1 at a multiplicity of infection = 10 for 1h. The inoculum was 
removed and washed twice with PBS and supplemented with medium. HeLa cells 
without ACE2 expression and Vero E6 cells were used as negative and positive 
controls, respectively. At 24h after infection, cells were washed with PBS and fixed 
with 4% formaldehyde in PBS (pH 7.4) for 20 min at 4°C. ACE2 expression was 
detected using goat anti-human ACE2 immunoglobulin (R&D Systems) followed 
by FITC-labelled donkey anti-goat immunoglobulin (PTGLab). Virus replication 
was detected using rabbit antibody against the SL-CoV Rp3 nucleocapsid protein 
followed by Cy3-conjugated mouse anti-rabbit IgG. Nuclei were stained with DAPI. 
Staining patterns were examined using a FV1200 confocal microscope (Olympus). 
Virus infectivity detected by real-time RT-PCR. Vero E6, A549, PK15, RSKT 
and HeLa cells with or without expression of ACE2 of different origins were inocu- 
lated with 0.1 TCIDs5, WIV-1 and incubated for 1 h at 37 °C. After removing the 
inoculum, the cells were cultured with medium containing 1% FBS. Supernatants 
were collected at 0, 12, 24 and 48h. RNA from 140 ul of each supernatant was 
extracted with the Viral RNA Mini Kit (Qiagen) following manufacturer’s instruc- 
tions and eluted in 60 pl buffer AVE (Qiagen). RNA was quantified on the ABI 
StepOne system, with the TaqMan AgPath-ID One-Step RT-PCR Kit (Applied 
Biosystems) in a 25 pl reaction mix containing 4 ul RNA, 1 X RT-PCR enzyme 
mix, 1 X RT-PCR buffer, 40 pmol forward primer (5’-GTGGTGGTGACGGCA 
AAATG-3’), 40 pmol reverse primer (5'-AAGTGAAGCTTCTGGGCCAG-3’) 
and 12 pmol probe (5’-FAM-AAAGAGCTCAGCCCCAGATG-BHQ1-3’). Ampli- 
fication parameters were 10 min at 50 °C, 10 min at 95 °C and 50 cycles of 15 sat 95 °C 
and 20s at 60 °C. RNA dilutions from purified WIV-1 stock were used as a standard. 
Serum neutralization test. SARS patient sera were inactivated at 56 °C for 30 min 
and then used for virus neutralization testing. Sera were diluted starting with 1:10 
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and then serially twofold diluted in 96-well cell plates to 1:40. Each 100 pl serum 
dilution was mixed with 100 ul viral supernatant containing 100 TCID5y of WIV1 
and incubated at 37 °C for 1 h. The mixture was added in triplicate wells of 96-well 
cell plates with plated monolayers of Vero E6 cells and further incubated at 37 °C 
for 2 days. Serum from a healthy blood donor was used as a negative control in 
each experiment. CPE was observed using an inverted microscope 2 days after 
inoculation. The neutralizing antibody titre was read as the highest dilution of 
serum which completely suppressed CPE in infected wells. The neutralization test 
was repeated twice. 

Recombination analysis. Full-length genomic sequences of SL-CoV Rs3367 or 
RsSHC014 were aligned with those of selected SARS-CoVs and bat SL-CoVs using 
Clustal X. The aligned sequences were preliminarily scanned for recombination 


events using Recombination Detection Program (RDP) 4.0 (ref. 19). The potential 
recombination events suggested by RDP owing to their strong P values (<10-20) 
were investigated further by similarity plot and bootscan analyses implemented in 
Simplot 3.5.1. Phylogenetic origin of the major and minor parental regions of 
Rs3367 or RsSHCO14 were constructed from the concatenated sequences of the 
essential ORFs of the major and minor parental regions of selected SARS-CoV and 
SL-CoVs. Two genome regions between three estimated breakpoints (20,827- 
26,553 and 26,554—28,685) were aligned independently using ClustalX and gene- 
rated two alignments of 5,727 base pairs and 2,133 base pairs. The two alignments 
were used to construct maximum likelihood trees to better infer the fragment 
parents. All nucleotide numberings in this study are based on Rs3367 genome 
position. 
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IISNCVADYSVEYNES TFS TFKCYGVSAITK LINDLCFSNVYADSFVVK@DVROIAPGOTGVIADYNYKLPDDFMGCVLAWNT| 
IISNCVADYSVELYNES TFS TFKRCYGVSAATK LINDLCFSNVYADSFVVKEMDVROIAPGOTGVIADYNYKLPDDFMGCVLAWNT| 
IISNCVADYSVLYNEISTFSTFKCYGVSAITK LINDLCFSNVYADSFVVK@@DVROIAPGOTGVIADYNYKLPDDFMGCVLAWNT 
IISNCVADYSVELYNESWTSFSTFKCYGVSAITK LINDLCFSNV YADSFVVK@M@DVROIAPGOTGVIADYNYKLPDDFMGCVLAWNT 
IISNCVADYSVLYNESTSFSTFKCYGVSAATK LINDLCFSNVYADSFVVK@M@DVROIAPGOTGVIADYNYKLPDDFMIGCVLAWNT, 
IISNCVADYSVLYNEISTSFSTFKCYGVSAITK LINDLCFSNVYADSFVVK@MIDVROIAPGOTGVIADYNYKLPDDFMG CVLAWNT 
IISNCVADYSVEYNEISTSFSTFRCYGVSMTK LINDLCFSNVYADSFVVKEM@DVROIAPGOTGVIADYNYKLPDDFMGCVLAWNT 
IISNCVADY THBAYNEISITS FSTFKCYGVSPSKLMEDLCFTSVYADTFLI REBIEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT 
IISDCVADYTVLYNEISTSFSTFKCYGVSHSKLESDLCFTSVYAD@FLI REBJEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT 
IITSDCVADYTVEYNEISTSFSTFRKCYGVSPSK LESDLCFTSVYADTFLI REBIEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT, 
ITSDCVADYTVLYNEISTSFSTFRCYGVSPASK LESDLCFTSVYADTFLI REBIEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT 
IISDCVADYTVLYNEISTSFSTFKRCYGVS9SK LESDLCFTSVYAD@FLIREBJEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT, 
ITSDCVADY TVBRYNEISITS FS TFINCYGVSIRSK LEBDLCFTSVYADTFLI REBSEVROVAPGOTGVIADYNYKLPDDFMIGCVIAWNT, 
ISDCVADYTWVLYNEISTSFSTFRCYGVSHSK LEBDLCFTSVYADTFLIREBIEVROVAPGETGVIADYNYKLPDDFMNGCVIAWNT| 
ISDCVADYTVLYNBSTSFSTFRCYGVSHSKLESDLCFTSVYADTFLIRSEBJEVROVAPGETGVIADYNYKLPDDFMIGCVIAWNT| 
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RNIPBATS TEN YNWKBRIY LR CRS PFS PDGKPCTPPALNCYW PRIN DNIGBHY TT TCHIch®) > RRA Ae 
RNIPMATS TEN YNBIKREAY LR CR MS PFS PDGKPCTPPALINC YW PRIN DICH Y TT TCHCE4)? RRA Aaa 
RNIPBATS TRIN YNWKBRIY LR CRESS PFS PDGKPCTPPALMNCYW PRIN DNIGHY TT TcHich®) > RRR Ae 
RNIPBATS TREN YNWKBRIY LR CRESS PFS PDGKPCTPPALNCYW PRK DMICGBHYTTs Cich®) > RRA Aaa 
RNIMATOTEN YNWKREAS LRH CRESS PFS PDGKPCTPPAFNCYW PRIN DICHYI TN CHICEO? ERRATA 
NSKBsSs TSEN YN LBRAW VR SE ERAS ET YS PGGOS CSAVG PIC YN PRR PRICE FTTACHCER? RRA ae 
NSKBsSS TSEN YN LBRAW VR SE BASIN I YS PGGOS CSAVG Pic YN PR PRICE FTTACWCER? RRA ae 


Ak Ofg- - - V@- - SHIFEAIs HRs S SEARS E -- -[NGVR THIS TRIDEIN PN V PRIDE) TEARS 
AK Of- - - Of8- - OB/YBAAs SINK TEMSESSOAEWSID FE - - - ---------- NGVR THIS TRDBY PN V PRERA TREATS 
AK Of- - - TR - - YR YBAAs HK TRESS D DG - - - --------- Nc V Y THIS TRIDEIN PN V PAAR), TEARS SAS 
AK Of- - - Of - - OR YBEIs SRIK TEERISESSSEORAES D 5: -- -[NGVR THIS TRIDBY PTV PH ER)A TERRA eS) 
AK Of)- - - Of8 - - OB YBAAs SIXK TRESS D FE -- -INGVR THIS TRIDEY PN V PHERS®)A TEEAAABIES SAS 
AK Of- - - V@- - SRFRAAs HAs SKSSASSJ EE - - - - --------- NGVR THIS TRDEN ONV PRERG)A TEARS CS) 
AK Of- - - O@ - - OM YBAAs HK THKESEOASSJD FE - - - - --------- NGVR THIS TRIDBY PSV PHAR), TRAAABTISE AAAS 
AQOb- - - off - - OB YREAs VINK ERS D & Nic Vv Y THIS TRDBY PSI PERG), TRAAABIESA-S) 
AK H)- - - T- - NBEVYBAAs HK TEOASSD DG - - - --------- Nc V Y THIS TRIDEIN PN V PAAR), TEARS AAS 


NS LB- ---SSN ER PRAIR FRE CRE C RESIN L FN PSGGTCSAEGLINCYK PHA SBiCHTOSS Cilc I? RRA 


ATV CG PKIES TWLT KNOCVNFEN FNGIDMNG TGV L THIS S KR F OB FOOFGRDMSDFTDSVRDPK TRJBILDISPCSFGGVSVITPGTN| 
ATV CG PKIES THILT KNOQCVNFEN FNGIDMNG TGV L THIS SKRF OM FOQOFGRDMSDFTDSVRDPK TBI LDISPCSFGGVSVITPGTN| 
ATV CG PKIGS TWJLT KNOCVNFEN FNG/DMNG/TG V L THIS S KR FON FOOFGRDMSDFTDSVRDPK TRBI LDISPCSFGGVSVITPGTN| 
A TV CG PKIES TRILVKNOCVNFENFNGDING TGV LTWSSKRFORFOOFGRDMSDFTDSVRDPOTMBI LDIETPCSFGGVSVITPGTN| 
ATV CG PKIES TAALVKNOCVNFEN FNGIDIGGHEG V LTS SKRF ORJFOOFGR DMS DF TDSVRDPQTMOILDIETPCSFGGVSVITPGCTN| 
ATV CG PKS TRILVKNOCVNFEN FNGIDINGIEG V LTS SKRF ORF OOFGR DMS DF TDSVRDPOTMEILDIZPCSFGGVSVITPGTN| 
ATV CG PKIES TEELVKNOCVNFNFNGDINGTGVLTWSSKRFORFOOFGRDMSDFTDSVRDPQTMOILDIETPCSFGGVSVITPGTN| 
ATV CG PKES TEILVKNOCVNFEN FNGIDINGIEGV LTWSSKRF ORF OOFGR DMS DFTDSVRDPOTMOILDIZPCSFGGVSVITPGTN| 
A TV CG PKIES TRILVKNOCVNFEN FNGBBNG TGV LTS S KU F ORF OOFGR DAS DFTDSVRDPOTMBMI LDIS PCS FGGVSVITPGTN| 
ATV CG PKIGS TRILVKNOCVNFN FNGIDING TGV L THIS SKRF ORJFOOFGRDMS DF TDSVRDPQTMBILDISPCSFGGVSVITPGTN| 
A TV CG PKIES TRILVKNOCVNFEN FNGIDIG/EG VL TMS S KR F OB/F OOFGR DS DF TDSVRDPQTMEI LDISPCSFGGVSVITPGTN| 
IA TV CG PKIES THRELVKNOCVNFEN FNGIDING/TG V L TRIS S KR F OR/F OOFGR DMS DF TDSVRDPOQTMEI LDIS PCSFGGVSVITPGTN| 
ATV CG PKI@S TRLVKNKCVN EN FNGIDMIG/TG V L Thy WF OOFGRDMSDFTDSVRDPK TMEI LDIFNPCSYGGVSVITPGTN| 


Extended Data Figure 1 | Sequence alignment of CoV S protein RBD. indicated with a bold vertical line on the left. The key amino acid residues 
SARS-CoV S protein (amino acids 310-520) is aligned with homologous involved in the interaction with human ACE2 are numbered on the top of the 
regions of bat SL-CoVs using ClustalW. The newly discovered bat SL-CoVs are aligned sequences. 
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Human SARS-CoV GZ02 ME La LP acne TORS BODe DIM CRMREOONCOEN Ditbau\O Heys SMIRG VY YIIDE FR SDBNLY LTO DSF L DEM SN V TRF Ee Tp GA DN PV TEI i@DG LY PAA TEK SNVURGUV FG] 
Human SARS-CoV BJO1 Ean Cg UP acne TORS BOD DM CRMEEOONCOEN PAULO ews SHIRG VY YIADE FR SDBNLY LTO DF L DFMSNV TSF ie TEMA DN PV IE rigDG 1Y PAA TEK SNVURGUWV FG| 
Human SARS-CoV Tor2 Mpg ig Up cee CBcleRcode DM CHEB NTOEN PbavIO betyS SMRG VY YPIDE [FR SDINLY LTO DMF L DFMSN V TREE Tp) MEMBERS EN PV IFi@DG [Y FAA TEKSNVURGWV FG] 
Civet SARS-CoV Sz3 Mpa Tig UP acne TORS BOD DM CMMEBOONCOEN PaO ews SHIRG VY YIADE FR SDBNLY LTO DEL DFM@SNV TRF ie Tp eeMGA? DN PV IE rigDG 1Y FAA TEK SNVURGNV FG| 
Bat SL-CoV Rs3367 Mice LAY LN Ao ACESS ad a CORON) We DIM LEAS SRG VY YIRDD IFR SN\MUHLMO DEIFL PEMSN V TSF iT peeeeO DN PL TF i@DG LY FAA TEKSNV IRGWV FG| 
M\@88 LAY Leo Acne ane cae CIB ODD) ais PENRO PMS SIIRG VY Yp2DD IFR SN\ALHLMO DSF L PFSNV TF Bl T ener DN PL is haDG I1YFAATEKSNVIRGWV FG| 
Bat SL-CoV RS672 MisguaV 54 1) Sales CBRE NO) Dede CeRceacbe de PHUBEK [arcs SMRG VY YMDDIFR SD\QUHL TO D4FL DFM IN LTMY Wop MneE Mead? DN OBE rel>G 1Y FAA TEKSNVVRGWIFG| 
Bat SL-CoV Rfl Midis Cpt 1 oe eRe Ne) Seis CEMMEIEE GNe PL@M\O (ESS SARGVY YMDD IFR SDMUMLTO D4FL DFASN LI@Y iS) eSOM@eMBa? DN PLUG FeIDGVY FAA TEKSNVERGWV FG| 
Bat SL-CoV Rm1 Mig 5) 1 See EE Nae) eRe CSB@MSS dae PLUS. (ASS SARGVY YMDD IFR SDMUMLTO D4FL DFMSN LT@Y iS) eeiem@ba? DN PLL FeDGVY FAA TEKSNVERGWIEG| 
Bat SL-CoV Rp3 Midge Up) [RRM Nee) HRs COMME S20) PVN IS SURG VY YMDD IFR SN\QUHL TO D4FL PFRSN LT@Y BS) \MMOESORBA? DN DI FeIDCV Y FAA TEKSNVIRGWIFG, 
Bat SL-CoV HKU3-1 Mids Ib 1 oO EE NG) [ee COB@eCE de) DIGENO SS SURG VY YMDD IER SD\QUHLTO D4FL PEMSN LT MY BS) MO MEBM@ed? DN DI UM re>GVY FAA TEKSNVIRGWIFG| 
Bat SARS-related CoV BM48-31 MK-PA FLcLBc raNaofic Kf TL SN Ks Psk U1) 7 SS: Bel ae > Ss Te VC AY OD sic KOR TY MEN By Gc BE: - a Ee 


Human SARS-CoV GZ02 ISTRINNEESO SIMI ENN STNV V IRINCNE BIG QO PRE EV BE@eMCROBSETIY LF RINA EN CT PEMIRSNA FEL DIBSSKEC NF KEIR EBV FKNIDG FLMVY [egy ie | DVL PEs EN BIL 
Human SARS-CoV BJ01 STRINNIASO SIMI TENN STNV V IRIN PIU Gey PRT EV Peet OBST CF RIVA EN CT PEMIBOA FEL DIRKECNFKEUR GEV FXNKDG FUMVY [egy Oe | DVIS MELD EG EN BIL 
Human SARS-CoV Tor2 STRNNIASO SIMI TENN STNVV IRANI Gey PRT EV Pe@eneR OBST CF RIVA EN CT PEMIRGA FEL DIEBKECNFKEDR GEV FXNKDG FUMVY [egy ae | DVIEMEL DEG EN BLL 
Civet_SARS-CoV_Sz3 STRNNIASO SIMI TENN STNV V IRAN Din Gey PRT EV Ewen OBEY CF RINA EN CT PEMIBOA FELDISSKECNFKEDR GEV FXNK DG FUMVY (egy ae | DVIEMEL DEG EN BLL 


IS TWIN Nii SO SMT IMNN STN LV TREACN FED Cla) PIG FLV SSO Bea Sod T PAIN A EN CT PESAV igs Fal | Dige@ok eC NF KIGR EBV FRNK DG FURY ely SB: Teste LPs EN IAL) 
IS TWIN Nii SO SMT IMNN STN LV TREACN FED lah) Pi FLV WSO Bea Sod T PAIN A EN CT PEBV age Fal | Dige@oK ESC NF KIGR EBV FRNK DG FOBVY [ely SB: Tite LPs EN TAL) 
IS TDNBYTO SENT IVNN STHI 1 TRAMCHAPIEL Ciao DULY U8 | Ren Base SIV Y SNA YN CTY DISV poe] S Fe] L Div BcOhiG NF KILR EYV FKNK DG POBSVY RENY Gb: T Diese LDS EB SWAL| 
IS THDNBATO SHAT IVNN STHI TIRY jai BN iMS« DME - Sac TOK SEW BANGS: 7 &« E> T's *N TER 7 SS © Mo TT AR CR Sg. Gy 
|S Sg DN#¥ TO SINT IVNN ST HI I IRMCN FIML Cia Dilly Mv ErcademnO ek SIV Y 8]SA EN CTY DINV ia SF's) LDewWek biG NFKIQLR EYV FKNK(@G FLINVY May peNV N Lposdepa PINs ESWC. | 

a io DIMIY iV EuersptevNO OB SIV Y 8]SA EN CTY Didv pag S F's) L Dew-WekK biG NF KLR EYV FKNR DG PLBSVY OMY pwAV N Lpo-deL Dias FST | 
Bat SL-CoV HKU3-1 IS SHPDNMATO SHAV IVNN STHI 1 TRAMCN FIBD Cia PUY WV BiSatemsO SBAIIVY 8] SA EN CTY DISV baad S Fa] | DMS BIC NF KIDR EVV FKNR DG FLBSVY (May WAV N Giese LPG ESWAL| 
Bat SARS-related CoV BM48 -3 1 Bagi THe @r Nic ReaD v GN FA DBA - - sGCo PY K Bw RAAT BN. Wee - AHERN Bs TN MN PK GH FERS HV SSE Tg HN YE PRN SG FR) SESRV i 


Human SARS-CoV GZ02 KP KL As IN TN ERE 1 LEON OO ERA AY FVGIELKEEDDEM Uy DENG TIMDAV DCSON PLAE LK CSWK SFMIMKG IY QTSN FRVJPISMHDV VR FPN I TNMCP PREV, 
Human SARS-CoV BJO1 Fc IL Figs NCTM ON OA A eV GIL EE DEM Uy DENG TI BDAV DCSON FLABLK CSWK SPBIBKG 1 QT SN PRUMPSOV VR FPN INCE FEIOV 
Human SARS-CoV Tor2 cP KL Ags INT TNR | LEO OOM EMEA AY F VCP KITTEN Uy DENG TIMDAV DCSON PLAE LK CSWK SFBIMKG 1YQTSN FRVMMPSEDV VR FPNT INMCPFEEV, 
Civet SARS-CoV SZ3 cD KL as TT NERA LEON OO EMME AY PVCU KTIMEM Liv DENG TIMDAV DCSON PLAE LK CSWK SFMIMKG IYO TSN FRVMMPSEDV VR FPNT INMCPFEEV, 


cP IK L Ps IN TT ER AL LEON See EMSA AY FVGHMLKEEPTEM Uy DENG TITDAV DC SON PLAELK CSMK SFMIMKG IVOTSN FRVMPSMEVVRFPNI INMCPFEEV| 
cP IK L os INT TERA LEON Eee EMSA AY FVGMILKESTTEM Uy DENG TITDAV DC SON PLAELK CSMK SFMIMKG IVOTSN FRVMPSMEVVRFPNI INMCPFEEV| 
PML ee TN TT SEK QV" ee Ee FPA AY FVGMLKPRSTEN LMFNENG TITDA 1DCAQN PLSELK CTEKNFMV EKG IYOTSN FRVEPIMEV IR FPNT INGCP FBKV 
PIM L Pgs IN TT SERV" Ml ERR PISA AYY VCMLK@ISTEN [EIFNONG TIMDAV DC SODPLAELK CTMK SFMVEKG IYOTSN FRVEPIMNEV VR FPNT INMMCP FBKV 
RP IML ge IN TT SURV" El EOE PMSA AYY VGMLK ATTEN [BFNENG TIMDAV DC SON PLAELK CTAKNEMVEKG IYOTSN FRVMPIIMEV VR FPN INGCP FBKV 
RP IMKL ge IN TT SURV" EOE FPOSA AYY VGMLK ATTEN [EFNENG TIMNA I DCAON PLAELK CTEKNEMVEKG IYO TSN FRVEPIMEV IR FPNT INGCP FBKV 
Bat SL-CoV HKU3-1 > IML Pgs IN TT SURV" EOE FPOISA AYY VGMLK STEN LMF NENG TITDAV DC SON PLAELK CTEKNFMVMKG IYOTSN FRVEPIMEV IR FPNT INGCP EBKV 
ESET IoBe SETS RRCO MBS VCECERCBN > U8 1 pas LN 1 Tami IM Cae CORE FEO SAY PVG PIL KEMTML Vr DENG TINDA TDC SODPLSELKCTBK SFAVIIKC IYOTSN FRVMDIMEV VREPNT TMACD MEV 


Human SARS-CoV GZ02 TE ESV REE SN C8 DE SV By NESE STEROLS SED CE SU ADS EVID O10 SOT TA DINE LE DIE Cy bao 
Human SARS-CoV BJO1 F PSV YAWERMIKT SN CVA DY SV BY NESTBIF STFRCYGV STK LMDL CF SNVY ADSFVV KEMDV RO TA POTGV IADYNYKLPDD AMG CV LAWN TEN OMMEIG IBA 
Human SARS-CoV Tor2 egngreeraverpcrarcva orca sag F STPRCYGVSMTKUMDL CF SNVYADSFVV KERIDVRO IA PGO TGV IADYNYKLPDD FIG CV LAWN Tita OREN IR 
Civet SARS-CoV $z3 F PSV YAWERMR TSN CVA DY SV BY NMSMEF STFRCYGV STK LIMDL CF SNVYADSFVVKEMIDVRO TA PGOTGVIADYNYK DPDD FNIGCV LAWN TN OMEING EBAY 
Bat SL-CoV Rs3367 5 CES SSUES & sae F STFRCYGVSMTKUMDL CF SNVYADSFVV KERIDVRO IA PGO TGV IADYNYKLPDD FING CV LAWN TEN CMON IRA 
NATH Pov YAWERMR TON CVA DY VEY NBSUPT STFRCYOV SMTKLMDL Cr ONVY ADSFVV KEMIDV RO 1A POO TCV IADINTRLPDD FMCCY LAWN cs 79 NE 
Bat SL-CoV Rs672 ENA SIF PNV YAWERBIKT SD CVA DY TV BV NESTE? STFRCYGV SESKLBDL CF TSVYADEFLIREMIEV ROVA P@ETGV IADYNYKLPDD FRG CV TAWN Ths OGRE f 
Bat SL-CoV Rfl EGG ESRNGES «ASIST STE NS “ESSN: - ATS OA TATSRTE ASEM: <8 —-v-- oe 
Bat SL-CoV Rm Eogsgrevranei irae ESE SS a: STS “SIIB 208-2 ok 
Bat SL-CoV Rp3 ENA TIP PNV YAWERPIK 1 SD CVA DY TVIN NMSTEF STFRCYGV SESKLBDL CF TSVYADIFLIREMEVROVA PGETGV IADYNYKLPDD FIG CV TAWN Tees) OIG BGI’ 
Bat SL-CoV HKU3-1 UG: NAAN «RISA GS ST LS Se - <A SRE ASEM: <——-: 
ELTA Tome SRC ROOM PECAN TEP PSV VAWERMIRT IN CVA Dv SV BV NESSMET STFOCYGV SBTKLMDL CF SSVYADMFVV KEMDV RO TA HMOTCV IADYNYKLPDDFRICCV TAWN Thi San Fo 


Human SARS-CoV GZ02 VR ABER EQS LEP FER D 1 SNS SaaS OBEN esi OY el Fy aie tel | ely © Pear VV VLSFELLNA PA TV CG PK BSTPL IKNO CVNENFNG LBS TG VL TESSKRFOBFOOF 
Human SARS-CoV BJO1 YR MBSR LRP FER D 1 SNBRRSB Die eee ewe LesaQiea NM Y (ed > aiid | egy O aR VV VL SF ELLNA PA TV CG PK DSTPJL IKNO CVNFNFNG LMG TC VL TEISSKRFOWFOOF 
Human SARS-CoV Tor2 VR ASER EQ LRP FER D 1 SNS Seen OBON CeeaiBe. Oey el yaaa ely O ESA VV VLSFELLNA PA TV Co PKIESTIOL IKNO CVNENFNGLBIS TGVL IBSSKRFOBFOOF 
Civet SARS-CoV SZ3 YR MBSR LRP FER D 1 SNBREB Die ome eoe men lesan | loan) Y (el > aise | ley O aR VV VL SF ELLNA PA TV CG PK DSTPJL IKNO CVNFNFENG LMG TC VL TEISSKRFOWFOOF 
Bat SL-CoV Rs3367 YR SBR LRP FER D 1 SNBRRB Die ome omaN lesahie Nia Y (el) am aifed | ley O aR VV VL SF ELLNA PA TV CG PK DSTIJL IKNO CVNFN FENG LG TC VL TEISSKRFOWFOOF 
Pa VR SEN EGA D IY s PGCGO SCSAVG ANJcYN Pi§r Pc hr TTacc = ——_$<$_$_—_—______,_____& G TGVLTSSKRFOWFOOF 
Bat SL-CoV Rs672 BRA s SRK TRS DE - - -- -- ------- IN leavesaaa pay be) FS @BoNAI I bly O BWR VV VL SF ELLNA PA TV CG PK BSTRILVKNO CVNEN FNG DigG TGV L TWISSKRFORFOOQOF 
Bat SL-CoV Rf1 BSH s SEAS © EH - - -- -- ------- Mcvr ee CVNFEN FENG iis TG VL TRISSKMIFORFO OF 
Bat SL-CoV Rml BEY sy Bk SSIES D E- - -- -- ------- IN ERMA FMW Y Be PaaS ba O WR VV VL SF ELLNA PA TV CG PK BST@ILVKNO CVNFNENG igs TG VL TWWSSKRFORIFOOF 
Bat SL-CoV Rp3 — i. __—__ een —hets—1l-sh- PSv Pa Mp eT) RIS Se - ESE Ee 
Bat SL-CoV HKU3-1 Pad SHQK Tip ees D DG - -- -- ------- IN eavaawu Lowa fe) FN Pay O WW VV VL SF ELLNA PA TV CG PKILSTIXLVKNOQ CVNFNFNG Lic TGV L TRJSSKRFORFOOF 


Bat SARS-related CoV BM48-31f8R FH cia: ROR Lrw Psce Tc sa Ec Ucy« Pia sic BT0S scHich@py RVVVLSFELLNA PA TV Co PKB STPLVANK CVNENENGIBD TCV Ll STKKFOWFOOF 


Human SARS-CoV GZ02 GR DJSD FTD SVRD PK TRB I LDISP CS FGGV SV IT PG TNASSBV AV LY ODVNCTDV SAMA IIA DO LIE PAWR LY STGRERN V FO TOAG CLIGA BHV D/SY ECDI PIGAG TCASY Ht 
Human SARS-CoV BJO01 GR DISD FTD SVRD PK TRB I LDISP CS FGGV SV IT PG TNASSBV AV LY QODVNCTDV BAMA IIA DO LIE PAWR LY STGRERN V FO TOAG CLIGA BHV D/SY ECDI PIGAG TCASY Hl 
Human SARS-CoV Tor2 GR DISD FTD SVRD PK TRB I LDISP CS FGGV SV IT PG TNASSBV AV LY ODVNCTDV SAMA IIA DO LIE PAWR LY STGREMRN V FO TOAG CLIGA BHV D/SY ECDI PIGAG ICASY Hl 
Civet SARS-CoV SZ3 IGR DISD FTD SVRD PK TijBI LDIJSP CS FGGV SV IT PG TNINSSIBV AV LY ODVNCTDV}2MiA TIBIA DO LIPPAWR LY STGREMN V FO TOAG CLIGA/BHV DWJSY ECDI PIGAG ICASYH 
Bat SL-CoV Rs3367 IGR DSD FTD SVRD PK TiBI LDIJSP CS FGGV SV IT PG TNMUSSIBV AV LY ODVNCTDV}AMA IEIA DO LIPSWR VY STGREN V FO TOAG CLIGA/BHV DJSY ECDI PIGAG ICASYH 
Bat SL-CoV RsSHC014 GR DSD FTD SVRD PK TRB I LDIJSP CS FGGV SV IT PG TNMISSIBV AV LY ODVNCTDV}AMA IEIA DO LIDPSWR VY STGREN V FO TOAG CLIGA/BHV DJSY ECDI PIGAG ICASYH 
Bat SL-CoV Rs672 GR DJSD FTD SVRD PO THO I LD I¥P CS FG GV SV IT PG TNNSSBV AV LY ODVNCTDV/aMAA IMMA DO LID PAWR VY STGMAMRN V FO TOAG CLIGA BHVNASY ECDI PIGAG ICASY Ht 
Bat SL-CoV Rfl GROMSDFTDSVRD PO MET LDISPCSFGGV SV ITPCTNESSMV AV LY ODVNCTDV EMM IMA DOLMPGWR VY NIGEEMEEMV FOTOAGCLIGABHUNMSY OCDIPIGAG ICASY RI 
Bat SL-CoV Rml GR DMSDFTDSVRDPO IMEI LDISPCSFGGV SV ITPCINNSSEVAV LY ODVNCTDV EMS INA DOLIBPAWR VY SIGHMMENV FOTOAGCLIGABHUNMSY ECDI PIGAG ICABY Bl 
Bat SL-CoV Rp3 GR DAISD FTD SVRD PO THIET LDIISP CS FGGV SV IT PG TNASSBV AV LY OQ DVNCTDVIZRA IIA DO LIEPAWR VY SIGH V FO TOAGCLIGABHVNASY ECDIPIGAG TCASY Hl 
Bat SL-CoV HKU3-1 GR DAJSD FTD SVRD PO THJELLDISP CS FGGV SV ITPGTNASS EV AV LY ODVNCTDV MMA TA DO LIE PAWR VY STGMBRRNV FO TOA G CLIGABHVNASY ECDIPIGAG ICASY Bt 
TRCN coe Be Bolo mo \ ME =)UC SB mack DY SD PTD SVRD PK TET LDIAPCSYGGV SV ITPGTNASSBV AV LY ODVNCTDV BMALIIA DO TSHSBWR VY ARBSRBBERN I FO TOAGCLIGAMY WINE SY ECDIPIGAG ICA MY 
Extended Data Figure 2 | Alignment of CoV S protein $1 sequences. SZ3 was identified from P. larvata in 2003 collected in Guangdong, China. 
Alignment of $1 sequences (amino acids 1-660) of the two novel batSL-CoVS_ — SL-CoV Rp3, Rs 672 and HKU3-1 were identified from R. sinicus collected in 
proteins with those of previously reported bat SL-CoVs and human and Guangxi, Guizhou and Hong Kong, China, respectively. Rfl and Rm1 were 
civet SARS-CoVs. The newly discovered bat SL-CoVs are boxed in red. identified from R. ferrumequinum and R. macrotis, respectively, collected in 
SARS-CoV GZ02, BJO1 and Tor2 were isolated from patients in the early, Hubei Province, China. Bat SARS-related CoV BM48-31 was identified from 


middle and late phase, respectively, of the SARS outbreak in 2003. SARS-CoV _ R. blasii collected in Bulgaria. 
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Extended Data Figure 3 | Complete RdRP sequence phylogeny. 


Phylogenetic tree of bat SL-CoVs and SARS-CoVs on the basis of complete 
RdRP sequences (2,796 nucleotides). Bat SL-CoVs RsSHC014 and Rs3367 are 


highlighted by filled circles. Three established coronaivirus genera, 


Alphacoronavirus, Betacoronavirus and Gammacoronavirus are marked as , 
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B-CoVs B 


B-CoVs D 
B-CoVs C 


B-CoVs A 


a-CoVs 


+ v-CoVs 


and y, respectively. Four CoV groups in the genus Betacoronavirus are 
indicated as A, B, C and D, respectively. MHV, murine hepatitis virus; 
PHEV, porcine haemagglutinating encephalomyelitis virus; PRCV, porcine 
respiratory coronavirus; FIPV, feline infectious peritonitis virus; IBV, 
infectious bronchitis coronavirus; BW, beluga whale coronavirus. 
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Extended Data Figure 4 | Sequence phylogeny of the complete S protein of 


Bat SL-CoVs RsSHC014 and Rs3367 are highlighted by filled circles. Bat CoV 
SL-CoVs and SARS-CoV. Phylogenetic tree of bat SL-CoVs and SARS-CoVs | HKU9 was used as an outgroup. 
on the basis of complete S protein sequences (1,256 amino acids). 
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Extended Data Figure 5 | Detection of potential recombination events. N (nucleotides 28,685) genes, respectively. Both analyses were performed with 
a, b, Similarity plot (a) and bootscan analysis (b) detected three recombination an F84 distance model, a window size of 1,500 base pairs and a step size of 


breakpoints in the bat SL-CoV Rs3367 or SHC014 genome. The three 300 base pairs. 
breakpoints were located at the ORF1b (nt 20,827), M (nucleotides 26,553) and 
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Extended Data Table 1 | Summary of sampling detail and CoV prevalence 


Sampling time 


April, 2011 
October, 2011 
May, 2012 


September, 2012 


Total number of swab or fecal samples 
collected 

14 

10 

54 

39 


Number of CoV PCR positive samples (%) 


1 (7.1) 
3 (30) 
4 (7.4) 
19 (48.7) 
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Extended Data Table 2 | Genomic sequence identities of bat SL-CoVs with SARS-CoVs 


Pairwise genomic nucleotide acids identity (%) 


Human and civet 
Bat SARS-Like CoVs 


SARS-CoVs 

Genome 
SHC014. Rs672 Rp3 Rfi Rmi HKU3-1 BM48-31 | GZ02 BJ01 Tor2 SZ3 

CoVs size (nt) 
3367 29,787 98.8 92.5 93.2 87.3 88.0 87.8 76.9 95.4 953 954 95.3 
SHC014 29,787 - 92.6 932 687.3 88.1 87.8 77.0 95.2 95.1 95.1 95.1 
Rs672 29,059 - 924 86.2 87.4 87.0 75.2 90.9 909 908 91.0 
Rp3 29,736 - 88.3 90.3 89.6 77.0 92.1 920 92.1 92.0 
Rf1 29,709 - 89.4 88.4 76.6 87.2 87.1 S72 oy. 
Rm1 29,749 - 90.1 76.4 87.6 S75 e875 87.5 
HKU3-1 29,728 - 76.8 87.4 s73 8/74 ‘87.3 
BM48-31 29,276 - 76.9 771 67740. F6S 
GZ02 29,760 - 996 996 99.7 
BJ01 29,725 - 99.8 99.6 
Tor2 29,751 - 99.5 

SZ3 29,741 7 
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Extended Data Table 3 | Genomic annotation and comparison of bat SL-CoV Rs3367 with human/civet SARS-CoVs and other bat SL-CoVs 


ORF Identity nt/aa (%) 


Human and civet SARS-CoVs Bat SARS-like CoVs 
Gee wvet - Giacad Buot Tor2 $Z3 Rs672 Rp3 Rft Rm1 HKU3-1 BM48-31 
Start-End (nt. Nt. Aa. TRS 
Pla 265-13,398 13,134 4,377 ACGAAC AUG 96.7/97.9 96.6/97.9  96.8/97.9 96.8/98.1 } 93.3/94.2  95.5/96.9 88.1/94.0 87.9/93.3 87.9/94.2 76.3/80.8 
P1b 13,398-21,485 8,088 2,695 96.3/99.2 96.3/99.2 96.3/99.2 96.3/99.2 | 97.2/99.2  97.2/99.2 90.6/98.4 91.0/98.7 90.7/98.5 83.4/93.7 
s 21,492-25,262 3,771 1,256 ACGAACAUG _ 88.3/90.1 88.2/90.0 88.1/89.8 88.2/90.0 | 76.5/78.2 76.0/79.1 74.0/77.4 76.3/79.1  75.6/78.2 70.2/74.5 
($1) * 21,493-23,535 2,043 681 78.2/81.1 78.2/80.9 78.1/80.6 78.2/81.1 | 65.1/62.2  63.9/63.0 62.9/62.5 64.7/63.3  65.2/63.4 62.2/64.7 
(S2) * 23,536-25,263 1,728 575 98.1/99.3 98.1/99.3 98.1/99.3 98.1/99.1 | 87.9/94.8  88.1/95.8 85.1/92.7 87.9/95.4 86/93.5 76.6/88.2 
ORF3a 25,271-26,095 825 274 ACGAAC AUG = 99.2/98.1  98.6/97.0  98.7/97.0  98.5/96.7 | 90.4/90.8  84.1/84.3 88.8/86.8 83.6/84.3 83.1/82.4 72.1/71.2 
ORF3b 25,692-26,036 345 114 99.1/99.1 98.2/98.2  98.2/98.2 97.9/97.3 | 99.1/98.2 N/D 82.6/92.1  N/D N/D N/D 
E 26,120-26,350 231 76 ACGAAC AUG = 98.7/98.6 98.7/98.6 98.7/98.6 98.7/98.6 | 99.1/98.6  97.8/98.6 96.5/96.0 96.1/97.3 97.4/98.6 91.3/93.4 
M 26,401-27,066 666 221 ACGAAC AUG = 97.4/98.1 97.2/98.1  97.2/98.1  97.2/97.7 | 98.7/99.5  93.3/98.1 96.3/98.6 93.2/95.4  93.9/96.8 78.5/88.1 
ORF6 27,077-27,268 192 63 ACGAAC AUG  97.3/95.2 96.8/93.6 97.3/95.2  97.3/95.2 | 97.3/96.8  95.8/92.0 94.2/92.0 95.3/92 94.7/90.4 63.5/49.2 
ORF7a 27,276-27,644 369 122 ACGAACAUG ~~ 94.5/95.9  94.5/95.9 94.5/95.9 94.5/95.9 | 97.8/100  96.2/99.1 92.9/95.0 93.4/97.5  93.2/97.5 62.3/58.1 
ORF7b 27,641-27,776 135 44 96.2/93.1 96.2/93.1  96.2/93.1 96.2/93.1 | 99.2/100  99.2/100 97.7/97.7 99.2/100  93.3/95.4 62.9/63.6 
ORF8 27,782-28,147 366 121 ACGAACAUG = 47.1/46.3 N/A N/A 47.1/46.3 | 97.8/100  85.2/90.2 46.2/39.0 85.7/90.2 85.7/85.3 N/A 
N 28,162-29,430 1,269 422 ACGAAC AUG 98.3/99.5  98.4/99.5 98.4/99.5 98.4/99.5 | 98/98.5  96.6/97.6 93.7/95.2 96.2/97.1 95.9/96.2 77.9/87.2 
s2m 29,628-29,668 41 97.5 97.5 97.5 97.5 100 100 100 100 100 95.1 


*S1, the N-terminal domain of the coronavirus S protein responsible for receptor binding. S2, the S protein C-terminal domain responsible for membrane fusion. 


The ORFs in the genome were predicted and potential protein sequences were translated. The pairwise comparisons were conducted for all ORFs at nucleotide acids (nt) and amino acids (aa) levels. The s2m were compared at nt 
level. TRS: Transcription regulating-sequences; N/D, not done; N/A, not available. 
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Extended Data Table 4 | Genomic annotation and comparison of bat SL-CoV RSSHCO14 with human/civet SARS-CoVs and other bat SL-CoVs 


ORF Identity nt/aa (%) 


Human and civet SARS-CoVs Bat SARS-like CoVs 
eee cae fame = Buot Tor2 $Z3 Rs672 Rp3 Rft Rm1 HKU3-1 BM48-31 
Start-End (nt. Nt. Aa. TRS 
Pla 265-13,398 13,134 4,377 ACGAAC AUG 96.7/97.9 96.6/97.9 96.8/97.9 96.8/98.1 ]| 93.3/94.2  95.5/96.9 88.1/94.0 87.9/93.3 87.9/94.2 76.3/80.8 
P1b 13,398-21,485 8,088 2,695 96.3/99.2 96.3/99.2  96.3/99.2 96.3/99.2 | 97.2/99.2  97.2/99.2 90.6/98.4 91.0/98.7  90.7/98.5 83.4/93.7 
s 21,492-25,262 3,771 1,256 ACGAACAUG __ 88.3/90.1 88.2/90.0 88.1/89.8 88.2/90.0 | 76.5/78.2 76.0/79.1 74.0/77.4 76.3/79.1  75.6/78.2 70.2/74.5 
($1) * 21,493-23,535 2,043 681 78.2/81.1 78.2/80.9 78.1/80.6 78.2/81.1 | 65.1/62.2  63.9/63.0 62.9/62.5 64.7/63.3 65.2/63.4 62.2/64.7 
($2) * 23,536-25,263 1,728 575 98.1/99.3 98.1/99.3 98.1/99.3 98.1/99.1 | 87.9/94.8  88.1/95.8 85.1/92.7 87.9/95.4 86/93.5 76.6/88.2 
ORF3a 25,271-26,095 825 274 ACGAAC AUG = 99.2/98.1  98.6/97.0  98.7/97.0 98.5/96.7 | 90.4/90.8  84.1/84.3 88.8/86.8 83.6/84.3 83.1/82.4 72.1/71.2 
ORF3b 25,692-26,036 345 114 99.1/99.1 98.2/98.2 98.2/98.2 97.9/97.3 | 99.1/98.2 N/D 82.6/92.1  N/D N/D N/D 
E 26,120-26,350 231 76 ACGAAC AUG = 98.7/98.6 98.7/98.6 98.7/98.6 98.7/98.6 | 99.1/98.6  97.8/98.6 96.5/96.0 96.1/97.3 97.4/98.6 91.3/93.4 
M 26,401-27,066 666 221 ACGAAC AUG = 97.4/98.1  97.2/98.1 97.2/98.1  97.2/97.7 | 98.7/99.5  93.3/98.1 96.3/98.6 93.2/95.4 93.9/96.8 78.5/88.1 
ORF6 27,077-27,268 192 63 ACGAAC AUG  97.3/95.2 96.8/93.6 97.3/95.2 97.3/95.2 | 97.3/96.8  95.8/92.0 94.2/92.0 95.3/92 94.7/90.4 63.5/49.2 
ORF7a 27,276-27,644 369 122 ACGAACAUG ~~ 94.5/95.9  94.5/95.9 94.5/95.9 94.5/95.9 | 97.8/100  96.2/99.1 92.9/95.0 93.4/97.5  93.2/97.5 62.3/58.1 
ORF7b 27,641-27,776 135 44 96.2/93.1 96.2/93.1  96.2/93.1 96.2/93.1 | 99.2/100  99.2/100 97.7/97.7 99.2/100  93.3/95.4 62.9/63.6 
ORF8 27,782-28,147 366 121 ACGAACAUG = 47.1/46.3 N/A N/A 47.1/46.3 | 97.8/100  85.2/90.2 46.2/39.0 85.7/90.2 —85.7/85.3 N/A 
N 28,162-29,430 1,269 422 ACGAAC AUG — 98.3/99.5  98.4/99.5 98.4/99.5 98.4/99.5 | 98/98.5  96.6/97.6 93.7/95.2 96.2/97.1 95.9/96.2 77 .9/87.2 
s2m 29,628-29,668 41 97.5 97.5 97.5 97.5 100 100 100 100 100 95.1 


*S1, the N-terminal domain of the coronavirus S protein responsible for receptor binding. S2, the S protein C-terminal domain responsible for membrane fusion. 


The ORFs in the genome were predicted and potential protein sequences were translated. The pairwise comparisons were conducted for all ORFs at nucleotide acids (nt) and amino acids (aa) levels. The s2m were compared at nt 
level. TRS: Transcription regulating-sequences; N/D, not done; N/A, not available. 
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Extended Data Table 5 | Cell lines used for virus isolation and susceptibility tests 


Cell lines Species (organ) origin ‘wai | Infectivity 
293T Human (kidney) 
DMEM+10%FBS 
PK15 Pig (kidney) 
BHK21 Hamster (kidney) 


ae Myotis davidii (kidney) RPMI1640+10%FBS 


RSKT Rhinolophus sinicus (kidney) 


MCKT Myotis chinensis (kidney) 
DMEM/F12+10%FBS 
Pteropus alecto (kidney) 
Rousettus leschenaulti (kidney) 


* Infectivity was determined by the presence of viral antigen detected by immunofluorescence assay. 


A549 Human (alveolar basal epithelial) ie 
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